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Abstract. - We investigate the electrical current and flow (number of parallel paths) between two 
sets of n sources and n sinks in complex networks. We derive analytical formulas for the average 
current and flow as a function of n. We show that for small n, increasing n improves the total 
transport in the network, while for large n bottlenecks begin to form. For the case of flow, this 
leads to an optimal n* above which the transport is less efficient. For current, the typical decrease 
in the length of the connecting paths for large n compensates for the effect of the bottlenecks. We 
also derive an expression for the average flow as a function of n under the common limitation that 
transport takes place between specific pairs of sources and sinks. 
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Transport processes, such as electrical current, diffusion, and flow, are fundamental in 
physics, chemistry, and biology. Transport properties depend critically on the structure 
of the medium, and have been studied for a large variety of geometries [1]. Of current 
interest are situations where the transport occurs on a network. For example, information is 
transferred over computer or social networks, vehicles traverse transportation networks, and 
electrical current flows in power-grid networks. Therefore, understanding of mechanisms to 
increase transport effectiveness is of great importance. 

Transport properties usually have been investigated in the context of transport between 
a single pair comprising one source and one sink [2-8]. The quality of transport strongly 
depends on the degree (number of connections) of the source and the sink, whereas the rest 
of the network serves as an approximately resistance-free substrate for the transport process. 
Consequently, the transport was found to strongly depend on the degree distribution. 

In more realistic situations, transport takes place between many nodes simultaneously. 
For example, in peer-to-peer and other computer networks users exchange files in parallel 
over the network links. In transportation networks, vehicles travel between many sources 
and many destinations through the network infrastructure. The presence of many paral- 
lel transport processes on the same underlying network leads to interactions between the 
different deliveries and a change in network efficiency. In this article we will quantify this 
phenomenon analytically and numerically, and show how different network usage leads to 
different behaviors. We reported some preliminary results in [9]. 

We focus on the class of non-directed, non-weighted model networks and we also study 
a real network. The first model is the Erdos-Renyi (ER) network, in which each link exists 
with independent small probability p. This leads to a Poisson degree distribution [10, 11]. 
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The second model is the scale-free (SF) network, characterized by a broad, power-law degree 
distribution and was recently found to describe many natural systems [12-14]. The ensemble 
of SF networks we treat is the "configuration model," in which node degrees are drawn from 
a power-law distribution (see below), and then open links are connected [15]. We also 
compute the distribution of flows in a real network, the internet [16]. 

We consider a transport process between two randomly chosen, non-overlapping sets 
(sources and sinks) of nodes of size n each, where 1 < n < N/2, N is the total number of 
nodes. We focus on three explicit forms of transport, described below. 

(i) Maximum flow (henceforth denoted flow) between the sources and sinks, when each 
link has unit capacity [8,9,16-19]. For non- weighted networks the flow is equivalent to the 
total number of disjoint paths (i.e. paths that do not share any edge) that connect the sources 
and sinks. Therefore, it quantifies any flow which does not deteriorate with distance, such 
as flow of frictionless fluids, traffic flow and information flow in communication networks. 

(ii) Electrical current in the network when the sources are short circuited into a unit 
electrical potential, and the sinks to the ground (assuming each link is a unit resistor). 
Electrical current has special significance since it also describes any general transport process 
in which the transport efficiency decreases with the length of the path (due to increase of 
resistance) . In addition, the current is equal to the probability of a random walker starting 
at any of the sources to escape to any of the sinks [20] . 

(Hi) Maximum multi- commodity (MC) flow, where the sources and sinks form ordered 
pairs, so the flow is directed from a given source to a specific sink, and not to any other 
sink [17,21] (here the network is directed and the sources and sinks may overlap). This 
describes direct communication between users in computer networks, or traffic of supplies 
over road networks. 

p-3 



S. Carmi et al. 



Our goal is to study the dependence of the total transport on the number of sources/sinks 
in the three transport forms. We denote the flow, electrical current, and MC flow by F, I, 
and F MC , respectively. In the case of a single pair, it was shown [3,9] that the transport 
between a source and a sink with degrees k\ and fc 2 is approximately: F n= \{k\ 1 k'i) « 
F^{(ki,k2) ~ min{fci,/c 2 } for dense enough networks, and I n =\(k\, fe) ~ c , where 
c < 1 is a fitting parameter. These equations state that the transport is dominated by the 
degrees of the involved nodes, and the rest of the network is practically a perfect conductor. 
Using the degree distribution: P{k) — (k) k jk\ for ER networks with average degree 

(k), and P{k) ~ fc~ 7 , k > m for SF networks with degree exponent 7 and minimum degree 
m, we can find the distribution of flow or current. 

Do transport properties change in the case of more than one pair? For a small number of 
sources/sinks n, we expect no significant difference. The backbone of the network remains 
an almost perfect conductor, but the degree k\ has to be replaced with the total number 
of links emanating from the sources, and similarly for the sinks [9]. Because it is assumed 
that n is small, we neglect the possibility of internal links inside each set. Define the sum of 
n degrees z = Y^i=i For ER networks, Pz(z) = e~ n ^ (n {k)) z /z\, a Poisson with mean 
n (k). For SF networks with 2 < 7 < 3, write P(s) for s — > 1, the generating function of 
P(k) for large k, as P(s) ~ 1 + A(l - s) + B(l - s^- 1 + O ((1 - sf). Raising P(s) to the 
power of n yields Pz(s) ~ 1 + nA(l — s) + nB(l — s) 7_1 + O ((1 — s) 2 ), and thus for large z 

P z (z) - z-~< , z > nm. (1) 

We define z\ and zi to be the sum of degrees of the n nodes in the sources and sinks, 
respectively. Using these definitions, F = min{zi, z 2 }, and / = c ^ 1 ^ . For the case of MC 
flow, the pairs are independent and the total flow is the sum of the flow of n independent 
pairs. 
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The pdfs corresponding are therefore 
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where in Eq. (4) kj.i and kj j2 are the degrees of jth source and sink, respectively. 
For ER networks we obtain a closed form formula for the flow pdf [9] 
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where j(a, x) and T(a) are the lower incomplete and complete gamma functions, respec- 
tively. An interesting quantity to study in a real transport system is the total average flow 
per source/sink F(n)/n = [^2 F FQ n (F)] /n, since it represents the overall transport effi- 
ciency of the system. For ER networks we calculate F(n)/n using Eq. (5), and the results 
are compared with simulations in Fig. 1. The theoretical prediction is in good agreement 
with the simulations for small values of n. For larger n, the transport is less efficient than 
predicted, since interactions between the paths begin to appear. Some paths become bot- 
tlenecks [22] and cannot serve more than their capacity. Moreover, some links are wasted 
as they connect nodes within the same set. In ER networks, the probability of having no 
intra-set links is (1 - n/N) n ^ w exp[-n 2 (k) /N}. Therefore already when n « (N/ (fc}) 1/2 , 
we expect some deviations, as is confirmed by our simulations shown in Fig. 1. 
For SF networks, we approximate the sum in (2) with an integral: 



poo 

$„(F) - F~ 7 J F'-">dF' - F-^-V. 



(6) 



A similar result holds for the current /. This is demonstrated in Fig. 2(a) for the internet 
at the Autonomous Systems (AS) level [16]. For MC flow, the minimum of the degrees 
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of each pair has a power-law distribution with exponent (27 — 1). From Eq. (1), the tail 
distribution of the sum remains a power-law with the same exponent, and therefore also in 
this case $„(F MC ) ~ [F MC ] ~ (27 ~ 1) . 

To evaluate the transport for the regime n » 1, we use a different approach. Let us 
concentrate on the case of flow in ER networks. We condition the total flow on the length 
of the path connecting a source and a sink, n 2 paths of length one (direct link between a 
source and a sink) are possible, and each exists with probability p = (k) /(N — 1). Denoting 
by Fg the average flow that goes through paths of length £, we have F\ = n 2 p. 

Paths of length two involve one intermediate node. If the intermediate node i is connected 
to n s (i) sources and n t (i) sinks, the flow it can channel is n m i n (i) = min{n s (z), n t (i)}. Since 
each edge exists with independent probability p, n s and n t are binomial variables with 
parameters (n,p). The probability for n m ; n to take the value m is given by 



P„ min (m) =2 



[P b {m)]\ (7) 



where Pb(j) = — p) n J since n s and n t are binomials. For large n and small p the 

binomial variable can be approximated by a Poisson distribution, for which Eq. (7) reduces 
to 

o (np) m e-"P f ^np) (npT£^\ 

The average of n m ; n is (n m i n ) = J2m=o m ^imi»( m )- Since there are (N — 2n) possibilities of 
choosing the intermediate node, F 2 = (N—2n) (n m i n ). Note that approximating F w Fi+F 2 
becomes accurate asn^ N/2, where there are only very few intermediate nodes and thus 
a very small probability for a long path. 

Paths of length three involve two intermediate nodes and their average number is more 
difficult to compute. However, by a mapping onto a matching problem we are able to provide 
lower and upper bounds for F 3 (see Appendix). In Fig. 3(a), we plot F{n) obtained with 
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the upper bound, neglecting flows of higher order F^F^,..., and find agreement with the 
simulations. 

An interesting outcome of the above calculation is an immediate result for the electrical 
current. Since all links have unit resistance, a path of length I has total resistance t so 
I = F1/I + F2/2 + F3/3+... . Our simulation results agree with our theory, as shown 
in Fig. 3(b). While it is not in general correct that in SF networks each edge exists with 
independent probability p, applying the same approach for SF networks results in a good 
qualitative agreement with the simulations — but not as good as for ER networks (Fig. 2(b)). 

Focusing attention on the transport per source/sink, F/n and I/n, we observe that while 
the flow F/n decreases with n, the current I/n increases (Fig. 4). The decrease in the case 
of the flow is intuitively clear. As more paths become "bottlenecks", the total number of 
paths connecting the sources and sinks decreases. However, when increasing n, the paths 
that exist have shorter lengths since the probability for direct or almost direct link between 
the sources and sinks is higher. While this effect does not influence the flow, in the case of 
electrical current it reduces the resistivity of the paths and increases the total current [9]. 
This is a fundamental difference between the two forms of transport, which our analysis 
reveals. 

For MC flow, paths cannot become shorter once a pair of a source and a sink is added, 
since the transport takes place between specific pairs [21] (Fig. 5(a)). Therefore, we expect 
that for some n* , the network will saturate and will not be able to carry any more flow. In 
other words, not only F MC /n will decrease, but F MC itself will not grow. 

To develop a theory for the average MC flow in ER networks, we look at the pairs of 
sources and sinks as if they are added one at a time. As the pair n is added, we assume the 
paths connecting the previous n — 1 pairs remain unchanged, such that the new pair can 
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connect only through these network links not already in use. We also assume the used links 
are randomly spread over the network and denote the average effective degree of the unused 
part as k n . Under these assumptions, the new pair will be connected, on average, with a 
number of paths that is the minimum of two degrees of average k n (see above). Thus, the 
total MC flow can be approximated as 

n-l 

F^(n) « ( 9 ) 

n'=0 

where = J2f =0 ^3^~J\ — ( ^rfj) ~ ^2j\ ) * s tne avera g e 01 the minimum of two degrees 
drawn from a Poisson distribution with average k, as in Eqs. (5) and (8). 

To complete the derivation, we need to find the average effective degree k n . This is 
straightforward to calculate, if we assume that to optimize the total flow (since we look 
at maximum flow), the transport between each new pair uses only shortest paths. Recall- 
ing that in ER networks with average degree (k) the average shortest path is of length 
log N/ log (k) [11], we can write a recursion equation for the evolution of k n 

log N 

k n +i = k n - y u(fc„)— — , k = (k). (10) 

N log k n 

Evaluating (9) and (10), we find agreement with simulations of the exact MC flow [21] 
(Fig. 5(b)). Note also, that this formula for the MC flow is an improvement over the result 
obtained with the small- n assumption, Eq. (2), as shown in the inset of Fig. 5(b). 

For large n such that k n is small, the new pair is no longer guaranteed to be connected. 
To find the value of n* , above which additional sources and sinks cannot communicate, we 
use the result of percolation theory that the network becomes fragmented when the average 
degree decreases below one [11]. Thus n* satisfies k n - — 1. Since substituting k n * = 1 in Eq. 
(10) does not yield a closed form formula, we bound n* by assuming the number of paths 
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connecting the nth pair u(fc„) satisfies 1 < fi{k n ) < k n . Eq. (10) splits into two inequalities 

k xl > k -k l ° gN (11) 

k n+ l > kn *n Nlogkn ^ 



N\ogk n 

Approximating k n +\ — k n « setting fc„ =0 = (&) an d solving the two differential inequal- 
ities, we obtain 

(logfc„) 2 > (log(fc)) 2 -2^n (12) 
(k n \ogk n -k n ) < ((fc)log(fc)-(fc))-^n. 

Substituting k n * = 1, the resulting bounds for n* are 

^<n*-^<(*> log <*>-<*> + !. d3) 

From (13), n* — 0(N/ log N), i.e., the maximal number of sources/sinks pairs that can 
communicate is of the order of N/ log TV. 

Due to the long computation time of the exact MC flow, only small system sizes could 
be considered in the simulations, thereby leading to a finite size effect that obscures the 
percolation transition in a way that the precise point where the network saturates cannot be 
observed (Fig. 5(b)). The continuous increase in the MC flow for n > n* in our simulations 
is expected since for finite networks, even when k n < 1 some nodes are still connected [23]. 
Moreover, the probability of a pair of nodes to belong to the same small (non-giant) cluster 
cannot be neglected for finite systems. 

For SF networks, the degree exponent 7 plays a significant role. For 2 < 7 < 3, there is 
no percolation threshold [24], and we expect more sources/sinks to be able to communicate 
in comparison to ER networks. For 7 > 3, a percolation threshold exists, and we expect 
behavior qualitatively similar to that of ER networks. However because of the limitation on 
computation time, we leave it as a conjecture. 
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In conclusion, we investigate the efficiency of transport in complex networks when many 
sources and sinks are communicating simultaneously. We obtain analytical results for the 
total electrical current and flow when the number of sources/sinks is very small. For a 
large number of sources/sinks, we derive approximate expressions for the total transport in 
ER networks by looking at the lengths of the paths used for the transport, and identify a 
fundamental difference between the behavior of flow and current. For multi-commodity flow, 
we calculate the mean value of the flow in ER networks. We also argue that in scale-free 
networks more sources/sinks can communicate because of the lack of a percolation threshold. 
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Appendix: Average number of paths of length three. To calculate the average 
number of paths of length three that connect the sources and sinks in ER networks, we note 
that these paths take the following form: 

Source — > IntcrmcdiatcNodcl — » IntcrmcdiateNode2 — ► Sink. (14) 

Since the pair of intermediate nodes can be connected by at most one link, the flow through 
this pair (in that specific direction) is either one or zero. To find out whether the path (14) 
is available, we must first classify each intermediate node i based on its number of links to 
the sources n s (i) and to the sinks n t (i)- Three classes are possible. 

1. n s (i) = n t {i). In this case all the links that connect i to the sources and sinks are 
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used in paths of length two, F 2 = n s (i) = n t (i), and i cannot be used for any F( with 
i > 2. 

2. n s (i) > n t (i). Node i uses n t (i) links in F 2 , and has n s (i) — n t (i) links free to use in 
Ft, I > 2. Thus, i can serve as an IntermediateNodel in the path (14). We call the 
set of such nodes 1\ , and denote its size by \I\ \ . 

3. n t {i) > n s (i). Node i uses n s (i) links in F 2 , and has n t (i) — n s (i) links free to use in 
Ft, I > 2. Thus, i can serve as an IntermediateN ode2 in the path (14). We call the 
set of such nodes 2 2 , and denote its size by \1 2 \. 

Each node in T\ or X 2 has at least one free link to use in F 3 . One has only to connect as 
many nodes in X\ to as many other nodes in Z 2 , such that the number of connections of a 
node in X\ will not exceed its value of n s — n t and that the number of connections of a node 
in 1 2 will not exceed its value of n t — n s . For a given node in 1\, denote the number of 
its spare links n s — nt as s\, and define s 2 similarly for a node in X 2 . The maximum flow 
problem is thus reduced to a generalized bipartite matching problem. This is illustrated in 
Fig. 6. Due of symmetry, = (\1 2 \) = and (si) = (s 2 ) = (s). In addition, these 

quantities can be calculated. The probability for an intermediate node to have n s > n t is: 

P(n s >n t ) = P 6 (1)A(0) + A(2)[A(0) + A(1)] + ... 



and therefore = (N - 2n)P(n s > n t ) = {N -2n) 1 - J27= [ p b{i)f /2 = The 

average number of spare links (si) = (s 2 ) = (s) is 

n 

(s) = (si) = ■ P{{n s -n t ) = i, , given n s > n t } 

i=l 

= y 8 . p|(n r nt) ; 8} (i6) 

^ P(n s >n t ) V ; 
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As a first approximation, we assume (s) = 1, to return to a regular bipartite matching 
problem. A recent theorem for ER bipartite networks has proved that a network with 
minimal degree at least 2 has a perfect matching with high probability [25]. Therefore, a 
lower bound for the matching size will be the 2-core of the bipartite network (consisting of the 
nodes in I\ and J 2 ), which is given by x where x — 1 — e _/3 (l +/3), and /3 is the solution 
of jj^ = 1 — [26]. Since each such matching contributes one unit of flow of length three, 
the total matching size x (|X|) is our lower bound for F3. Neglecting flows of higher orders 
F4, F 5 , we obtain a lower bound for the average flow: F = n 2 p+ (N — 2n) (n m i n ) +x {\T\). 
This bound becomes exact in the limit of large n, where there are very few intermediate 
nodes and thus a very small probability for a long path. 

An upper bound for the flow can be obtained by assuming that each node in the bipartite 
graph is able to match (on average) the minimum between its degree (in the bipartite 
network) and its number of spare links s. This overestimation of the flow of length three 
happens to compensate for neglecting flows of longer lengths, and in most cases agrees with 
simulations (see Fig. 3). 
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Fig. 1: Optimal number of sources/sinks. Average flow per source/sink F/(n (k)) vs. the number 
of sources/sinks n. Symbols represent simulation results (N = 4096, (k) — 8, 16; average is taken 
over many realizations of the network and many randomly chosen pairs), while lines represent the 
theory based on the small-n assumption (Eq. (5)). For n < sj N/ (k) (indicated with an arrow), 
the flow per source/sink always increases, as predicted by the theory. However, there is an optimal 
point beyond which the flow decreases. 
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Fig. 2: Flow in SF networks, (a) Probability distribution of the flow $ n (F) vs. F, for n — 1, 3, 5, 10 
(symbols), for the network of the Internet as of 2007 [16], which is approximately scale-free with 
degree exponent 7 ps 2.5. Since F is at least mn, a normalized power-law takes the form & n (F) ~ 
n 27-2j ? -(2 7 -i) -phus, we diving the vertical axis by n 27-2 to make all curves collapse. Theoretical 
slope for all n's, —(27 — 1) = —4 is indicated by the straight line, (b) Average flow F / (k) vs. the 
number of sources/sinks n for random SF networks. Simulation results (iV = 4096, 7 = 2.5,m = 2), 
are shown in circles, lines represent theoretical curves. For the theory we used the upper bound for 
paths of lengths up to three (see main text and Appendix) . 
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Fig. 3: Transport in ER networks, (a) Average flow F / (k) vs. the number of sources/sinks n. 
Simulation results (iV = 4096, {k) = 8, 16), are shown in symbols, lines represent theoretical curves. 
For the theory we used the upper bound for paths of lengths up to three (see main text and 
Appendix), (b) Same as (a) for the average electrical current / (N = 1024, (k) = 4,8). 
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Fig. 4: Transport normalized by the number of sources/sinks. Average flow (panel (a)) per 
source/sink F/(n{k)), and average electrical current per source/sink I/(n(k)) (panel (b)), vs. 
the number of sources/sinks n. Note the major difference between the flow and the current in the 
behavior of the transport per source/sink: while F/n decreases with n, I/n increases. 
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Fig. 5: Multi-commodity flow in networks, (a) A schematic illustrating the fundamental difference 
between flow and MC flow with respect to path lengths. The sources are (A, B) and the sinks are 
(A',B'). While for flow any source can connect to any sink, for MC flow A must connect to A' 
and B must connect to B', even at the cost of using longer paths, (b) MC flow F MC vs. n for ER 
networks with N = 128 and (k) — 3, 4, 5, 6. Symbols correspond to simulations and solid lines to 
Eqs. (9) and (10) (calculated up to n* , see text). Inset: for (k) = 3, we compare the simulation 
(circles) and theory (Eqs. (9) and (10), solid line) with the small-n approximation F MC (n) — nF™S[ 
(dashed line). Indeed, the small-n approximation overestimates the MC flow for large n, since it 
does not take into account the decrease in the average effective degree. 
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Fig. 6: A schematic of the bipartite network induced during the calculation of F3 (Appendix). After 
taking into account F\ and F2, we discard all direct links between the sources and the sinks, as 
well as all intermediate nodes which have n s = rit and thus all of their links to the sources and 
sinks are exploited in F2. This leaves us with two sets: nodes in T\ which have spare links to the 
sources, and nodes in T2 which have spare links to the sinks. Only these spare links are drawn here, 
as well as the links that connect nodes in Xi to nodes in I2 • Links annotated with an arrow can 
carry one unit of flow, in the direction indicated, such that F3 = 4 in this example. Note that one 
intermediate node can channel more than one unit of flow, by being matched with more than one 
intermediate node from the other set. 
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